28 resultados para Transcriptome

em Duke University


Relevância:

20.00% 20.00%

Publicador:

Resumo:

BACKGROUND: There is considerable interest in the development of methods to efficiently identify all coding variants present in large sample sets of humans. There are three approaches possible: whole-genome sequencing, whole-exome sequencing using exon capture methods, and RNA-Seq. While whole-genome sequencing is the most complete, it remains sufficiently expensive that cost effective alternatives are important. RESULTS: Here we provide a systematic exploration of how well RNA-Seq can identify human coding variants by comparing variants identified through high coverage whole-genome sequencing to those identified by high coverage RNA-Seq in the same individual. This comparison allowed us to directly evaluate the sensitivity and specificity of RNA-Seq in identifying coding variants, and to evaluate how key parameters such as the degree of coverage and the expression levels of genes interact to influence performance. We find that although only 40% of exonic variants identified by whole genome sequencing were captured using RNA-Seq; this number rose to 81% when concentrating on genes known to be well-expressed in the source tissue. We also find that a high false positive rate can be problematic when working with RNA-Seq data, especially at higher levels of coverage. CONCLUSIONS: We conclude that as long as a tissue relevant to the trait under study is available and suitable quality control screens are implemented, RNA-Seq is a fast and inexpensive alternative approach for finding coding variants in genes with sufficiently high expression levels.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Cryptococcus neoformans is a pathogenic basidiomycetous yeast responsible for more than 600,000 deaths each year. It occurs as two serotypes (A and D) representing two varieties (i.e. grubii and neoformans, respectively). Here, we sequenced the genome and performed an RNA-Seq-based analysis of the C. neoformans var. grubii transcriptome structure. We determined the chromosomal locations, analyzed the sequence/structural features of the centromeres, and identified origins of replication. The genome was annotated based on automated and manual curation. More than 40,000 introns populating more than 99% of the expressed genes were identified. Although most of these introns are located in the coding DNA sequences (CDS), over 2,000 introns in the untranslated regions (UTRs) were also identified. Poly(A)-containing reads were employed to locate the polyadenylation sites of more than 80% of the genes. Examination of the sequences around these sites revealed a new poly(A)-site-associated motif (AUGHAH). In addition, 1,197 miscRNAs were identified. These miscRNAs can be spliced and/or polyadenylated, but do not appear to have obvious coding capacities. Finally, this genome sequence enabled a comparative analysis of strain H99 variants obtained after laboratory passage. The spectrum of mutations identified provides insights into the genetics underlying the micro-evolution of a laboratory strain, and identifies mutations involved in stress responses, mating efficiency, and virulence.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

Olfactory sensory neurons (OSNs), which detect a myriad of odorants, are known to express one allele of one olfactory receptor (OR) gene (Olfr) from the largest gene family in the mammalian genome. The OSNs expressing the same OR project their axons to the main olfactory bulb where they converge to form glomeruli. This “One neuron-one receptor rule” makes the olfactory epithelium (OE), which consists of a vast number of OSNs expressing unique ORs, one of the most heterogeneous cell populations. However, the mechanism of how the single OR allele is chosen remains unclear along with the question of whether one OSN only expresses a single OR gene, a hypothesis that has not been rigorously verified while we performed the experiments. Moreover, failure of axonal targeting to single glomerulus was observed in MeCP2 deficient OSNs where delayed development was proposed as an explanation for the phenotype. How Mecp2 mutation caused this aberrant targeting is not entirely understood.

In this dissertation, we explored the transcriptomes of single and mature OSNs by single-cell RNA-Seq to reveal their heterogeneity and further studied the OR gene expression from these isolated OSNs. The singularity of sequenced OSNs was ensured by the observation of monoallelic expression of X-linked genes from the hybrid samples from crosses between mice of different strains where strain-specific polymorphisms could be used to track the allelic origins of SNP-containing reads. The clustering of expression profiles from triplicates that originated from the same cell assured that the transcriptomic identities of OSNs were maintained through the experimental process. The average gene expression profiles of sequenced OSNs correlated well to the conventional transcriptome data of FACS-sorted Omp-positive cells, and the top-ranked expression of OR was conceded in the single-OSN transcriptomes. While exploring cellular diversity, in addition to OR genes, we revealed nearly 200 differentially expressed genes among the sequenced OSNs in this study. Among the 36 sequenced OSNs, eight cells (22.2%) showed multiple OR gene expression and the presences of additional ORs were not restricted to the neighbor loci that shared the transcriptional effect of the primary OR expression, suggesting that the “One neuron-one receptor rule” might not be strictly true at the transcription level. All of the inferable ORs, including additional co-expressed ORs, were shown to be monoallelic. Our sequencing of 21 Mecp2308 mutant OSNs, of which 62% expressed more than one OR genes, and the expression levels of the additional ORs were significantly higher than those in the wild-type, suggested that MeCP2 plays a role in the regulation of singular OR gene expression. Dual label in situ hybridization along with the sequence data revealed that dorsal and ventral ORs were co-expressed in the same Mecp2 mutant OSN, further implying that MeCP2 might be involved in regulation of OR territories in the OE. Our results suggested a new role of MeCP2 in OR gene choice and ratified that this multiple-OR expression caused by Mecp2 mutation did not accompany delayed OSN development that has been observed in the previous studies on the Mecp2 mutants.

Relevância:

20.00% 20.00%

Publicador:

Resumo:

The carotid body (CB) is a major arterial chemoreceptor containing glomus cells that are activated by changes in arterial blood contents including oxygen. Despite significant advancement in the characterization of their physiological properties, our understanding on the underlying molecular machinery and signaling pathway in CB glomus cells is still limited.

To overcome these limitations, in chapter 1, I demonstrated the first transcriptome profile of CB glomus cells using single cell sequencing technology, which allowed us to uncover a set of abundantly expressed genes, including novel glomus cell-specific transcripts. These results revealed involvement of G protein-coupled receptor (GPCR) signaling pathway, various types of ion channels, as well as atypical mitochondrial subunits in CB function. I also identified ligands for the mostly highly expressed GPCR (Olfr78) in CB glomus cells and examined this receptor’s role in CB mediated hypoxic ventilatory response.

Current knowledge of CB suggest glomus cells rely on unusual mitochondria for their sensitivity to hypoxia. I previously identified the atypical mitochondrial subunit Ndufa4l2 as a highly over-represented gene in CB glomus cells. In chapter 2, to investigate the functional significance of Ndufa4l2 in CB function, I phenotyped both Ndufa4l2 knockout mice and mice with conditional Ndufa4l2 deletion in CB glomus cells. I found that Ndufa4l2 is essential to the establishment of regular breathing after birth. Ablating Ndufa4l2 in postnatal CB glomus cells resulted in defective CB sensitivity to hypoxia as well as CB mediated hypoxic ventilatory response. Together, our data showed that Ndufa4l2 is critical to respiratory control and the oxygen sensitivity of CB glomus cells.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Although lactic acidosis is a prominent feature of solid tumors, we still have limited understanding of the mechanisms by which lactic acidosis influences metabolic phenotypes of cancer cells. We compared global transcriptional responses of breast cancer cells in response to three distinct tumor microenvironmental stresses: lactic acidosis, glucose deprivation, and hypoxia. We found that lactic acidosis and glucose deprivation trigger highly similar transcriptional responses, each inducing features of starvation response. In contrast to their comparable effects on gene expression, lactic acidosis and glucose deprivation have opposing effects on glucose uptake. This divergence of metabolic responses in the context of highly similar transcriptional responses allows the identification of a small subset of genes that are regulated in opposite directions by these two conditions. Among these selected genes, TXNIP and its paralogue ARRDC4 are both induced under lactic acidosis and repressed with glucose deprivation. This induction of TXNIP under lactic acidosis is caused by the activation of the glucose-sensing helix-loop-helix transcriptional complex MondoA:Mlx, which is usually triggered upon glucose exposure. Therefore, the upregulation of TXNIP significantly contributes to inhibition of tumor glycolytic phenotypes under lactic acidosis. Expression levels of TXNIP and ARRDC4 in human cancers are also highly correlated with predicted lactic acidosis pathway activities and associated with favorable clinical outcomes. Lactic acidosis triggers features of starvation response while activating the glucose-sensing MondoA-TXNIP pathways and contributing to the "anti-Warburg" metabolic effects and anti-tumor properties of cancer cells. These results stem from integrative analysis of transcriptome and metabolic response data under various tumor microenvironmental stresses and open new paths to explore how these stresses influence phenotypic and metabolic adaptations in human cancers.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

BACKGROUND: Since mature erythrocytes are terminally differentiated cells without nuclei and organelles, it is commonly thought that they do not contain nucleic acids. In this study, we have re-examined this issue by analyzing the transcriptome of a purified population of human mature erythrocytes from individuals with normal hemoglobin (HbAA) and homozygous sickle cell disease (HbSS). METHODS AND FINDINGS: Using a combination of microarray analysis, real-time RT-PCR and Northern blots, we found that mature erythrocytes, while lacking ribosomal and large-sized RNAs, contain abundant and diverse microRNAs. MicroRNA expression of erythrocytes was different from that of reticulocytes and leukocytes, and contributed the majority of the microRNA expression in whole blood. When we used microRNA microarrays to analyze erythrocytes from HbAA and HbSS individuals, we noted a dramatic difference in their microRNA expression pattern. We found that miR-320 played an important role for the down-regulation of its target gene, CD71 during reticulocyte terminal differentiation. Further investigation revealed that poor expression of miR-320 in HbSS cells was associated with their defective downregulation CD71 during terminal differentiation. CONCLUSIONS: In summary, we have discovered significant microRNA expression in human mature erythrocytes, which is dramatically altered in HbSS erythrocytes and their defect in terminal differentiation. Thus, the global analysis of microRNA expression in circulating erythrocytes can provide mechanistic insights into the disease phenotypes of erythrocyte diseases.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

A strand-specific transcriptome sequencing strategy, directional ligation sequencing or DeLi-seq, was employed to profile antisense transcriptome of Schizosaccharomyces pombe. Under both normal and heat shock conditions, we found that polyadenylated antisense transcripts are broadly expressed while distinct expression patterns were observed for protein-coding and non-coding loci. Dominant antisense expression is enriched in protein-coding genes involved in meiosis or stress response pathways. Detailed analyses further suggest that antisense transcripts are independently regulated with respect to their sense transcripts, and diverse mechanisms might be potentially involved in the biogenesis and degradation of antisense RNAs. Taken together, antisense transcription may have profound impacts on global gene regulation in S. pombe.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

There is great interindividual variability in HIV-1 viral setpoint after seroconversion, some of which is known to be due to genetic differences among infected individuals. Here, our focus is on determining, genome-wide, the contribution of variable gene expression to viral control, and to relate it to genomic DNA polymorphism. RNA was extracted from purified CD4+ T-cells from 137 HIV-1 seroconverters, 16 elite controllers, and 3 healthy blood donors. Expression levels of more than 48,000 mRNA transcripts were assessed by the Human-6 v3 Expression BeadChips (Illumina). Genome-wide SNP data was generated from genomic DNA using the HumanHap550 Genotyping BeadChip (Illumina). We observed two distinct profiles with 260 genes differentially expressed depending on HIV-1 viral load. There was significant upregulation of expression of interferon stimulated genes with increasing viral load, including genes of the intrinsic antiretroviral defense. Upon successful antiretroviral treatment, the transcriptome profile of previously viremic individuals reverted to a pattern comparable to that of elite controllers and of uninfected individuals. Genome-wide evaluation of cis-acting SNPs identified genetic variants modulating expression of 190 genes. Those were compared to the genes whose expression was found associated with viral load: expression of one interferon stimulated gene, OAS1, was found to be regulated by a SNP (rs3177979, p = 4.9E-12); however, we could not detect an independent association of the SNP with viral setpoint. Thus, this study represents an attempt to integrate genome-wide SNP signals with genome-wide expression profiles in the search for biological correlates of HIV-1 control. It underscores the paradox of the association between increasing levels of viral load and greater expression of antiviral defense pathways. It also shows that elite controllers do not have a fully distinctive mRNA expression pattern in CD4+ T cells. Overall, changes in global RNA expression reflect responses to viral replication rather than a mechanism that might explain viral control.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

This is a crucial transition time for human genetics in general, and for HIV host genetics in particular. After years of equivocal results from candidate gene analyses, several genome-wide association studies have been published that looked at plasma viral load or disease progression. Results from other studies that used various large-scale approaches (siRNA screens, transcriptome or proteome analysis, comparative genomics) have also shed new light on retroviral pathogenesis. However, most of the inter-individual variability in response to HIV-1 infection remains to be explained: genome resequencing and systems biology approaches are now required to progress toward a better understanding of the complex interactions between HIV-1 and its human host.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Transient overexpression of defined combinations of master regulator genes can effectively induce cellular reprogramming: the acquisition of an alternative predicted phenotype from a differentiated cell lineage. This can be of particular importance in cardiac regenerative medicine wherein the heart lacks the capacity to heal itself, but simultaneously contains a large pool of fibroblasts. In this study we determined the cardio-inducing capacity of ten transcription factors to actuate cellular reprogramming of mouse embryonic fibroblasts into cardiomyocyte-like cells. Overexpression of transcription factors MYOCD and SRF alone or in conjunction with Mesp1 and SMARCD3 enhanced the basal but necessary cardio-inducing effect of the previously reported GATA4, TBX5, and MEF2C. In particular, combinations of five or seven transcription factors enhanced the activation of cardiac reporter vectors, and induced an upregulation of cardiac-specific genes. Global gene expression analysis also demonstrated a significantly greater cardio-inducing effect when the transcription factors MYOCD and SRF were used. Detection of cross-striated cells was highly dependent on the cell culture conditions and was enhanced by the addition of valproic acid and JAK inhibitor. Although we detected Ca(2+) transient oscillations in the reprogrammed cells, we did not detect significant changes in resting membrane potential or spontaneously contracting cells. This study further elucidates the cardio-inducing effect of the transcriptional networks involved in cardiac cellular reprogramming, contributing to the ongoing rational design of a robust protocol required for cardiac regenerative therapies.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

There is great potential for host-based gene expression analysis to impact the early diagnosis of infectious diseases. In particular, the influenza pandemic of 2009 highlighted the challenges and limitations of traditional pathogen-based testing for suspected upper respiratory viral infection. We inoculated human volunteers with either influenza A (A/Brisbane/59/2007 (H1N1) or A/Wisconsin/67/2005 (H3N2)), and assayed the peripheral blood transcriptome every 8 hours for 7 days. Of 41 inoculated volunteers, 18 (44%) developed symptomatic infection. Using unbiased sparse latent factor regression analysis, we generated a gene signature (or factor) for symptomatic influenza capable of detecting 94% of infected cases. This gene signature is detectable as early as 29 hours post-exposure and achieves maximal accuracy on average 43 hours (p = 0.003, H1N1) and 38 hours (p-value = 0.005, H3N2) before peak clinical symptoms. In order to test the relevance of these findings in naturally acquired disease, a composite influenza A signature built from these challenge studies was applied to Emergency Department patients where it discriminates between swine-origin influenza A/H1N1 (2009) infected and non-infected individuals with 92% accuracy. The host genomic response to Influenza infection is robust and may provide the means for detection before typical clinical symptoms are apparent.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Lymphomas comprise a diverse group of malignancies derived from immune cells. High throughput sequencing has recently emerged as a powerful and versatile method for analysis of the cancer genome and transcriptome. As these data continue to emerge, the crucial work lies in sorting through the wealth of information to hone in on the critical aspects that will give us a better understanding of biology and new insight for how to treat disease. Finding the important signals within these large data sets is one of the major challenges of next generation sequencing.

In this dissertation, I have developed several complementary strategies to describe the genetic underpinnings of lymphomas. I begin with developing a better method for RNA sequencing that enables strand-specific total RNA sequencing and alternative splicing profiling in the same analysis. I then combine this RNA sequencing technique with whole exome sequencing to better understand the global landscape of aberrations in these diseases. Finally, I use traditional cell and molecular biology techniques to define the consequences of major genetic alterations in lymphoma.

Through this analysis, I find recurrent silencing mutations in the G alpha binding protein GNA13 and associated focal adhesion proteins. I aim to describe how loss-of-function mutations in GNA13 can be oncogenic in the context of germinal center B cell biology. Using in vitro techniques including liquid chromatography-mass spectrometry and knockdown and overexpression of genes in B cell lymphoma cell lines, I determine protein binding partners and downstream effectors of GNA13. I also develop a transgenic mouse model to study the role of GNA13 in the germinal center in vivo to determine effects of GNA13 deletion on germinal center structure and cell migration.

Thus, I have developed complementary approaches that span the spectrum from discovery to context-dependent gene models that afford a better understanding of the biological function of aberrant events and ultimately result in a better understanding of disease.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

The mechanisms involved in the recognition of microbial pathogens and activation of the immune system have been extensively studied. However, the mechanisms involved in the recovery phase of an infection are incompletely characterized at both the cellular and physiological levels. Here, we establish a Caenorhabditis elegans-Salmonella enterica model of acute infection and antibiotic treatment for studying biological changes during the resolution phase of an infection. Using whole genome expression profiles of acutely infected animals, we found that genes that are markers of innate immunity are down-regulated upon recovery, while genes involved in xenobiotic detoxification, redox regulation, and cellular homeostasis are up-regulated. In silico analyses demonstrated that genes altered during recovery from infection were transcriptionally regulated by conserved transcription factors, including GATA/ELT-2, FOXO/DAF-16, and Nrf/SKN-1. Finally, we found that recovery from an acute bacterial infection is dependent on ELT-2 activity.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Single-molecule sequencing instruments can generate multikilobase sequences with the potential to greatly improve genome and transcriptome assembly. However, the error rates of single-molecule reads are high, which has limited their use thus far to resequencing bacteria. To address this limitation, we introduce a correction algorithm and assembly strategy that uses short, high-fidelity sequences to correct the error in single-molecule sequences. We demonstrate the utility of this approach on reads generated by a PacBio RS instrument from phage, prokaryotic and eukaryotic whole genomes, including the previously unsequenced genome of the parrot Melopsittacus undulatus, as well as for RNA-Seq reads of the corn (Zea mays) transcriptome. Our long-read correction achieves >99.9% base-call accuracy, leading to substantially better assemblies than current sequencing strategies: in the best example, the median contig size was quintupled relative to high-coverage, second-generation assemblies. Greater gains are predicted if read lengths continue to increase, including the prospect of single-contig bacterial chromosome assembly.

Relevância:

10.00% 10.00%

Publicador:

Resumo:

Activation of CD4+ T cells results in rapid proliferation and differentiation into effector and regulatory subsets. CD4+ effector T cell (Teff) (Th1 and Th17) and Treg subsets are metabolically distinct, yet the specific metabolic differences that modify T cell populations are uncertain. Here, we evaluated CD4+ T cell populations in murine models and determined that inflammatory Teffs maintain high expression of glycolytic genes and rely on high glycolytic rates, while Tregs are oxidative and require mitochondrial electron transport to proliferate, differentiate, and survive. Metabolic profiling revealed that pyruvate dehydrogenase (PDH) is a key bifurcation point between T cell glycolytic and oxidative metabolism. PDH function is inhibited by PDH kinases (PDHKs). PDHK1 was expressed in Th17 cells, but not Th1 cells, and at low levels in Tregs, and inhibition or knockdown of PDHK1 selectively suppressed Th17 cells and increased Tregs. This alteration in the CD4+ T cell populations was mediated in part through ROS, as N-acetyl cysteine (NAC) treatment restored Th17 cell generation. Moreover, inhibition of PDHK1 modulated immunity and protected animals against experimental autoimmune encephalomyelitis, decreasing Th17 cells and increasing Tregs. Together, these data show that CD4+ subsets utilize and require distinct metabolic programs that can be targeted to control specific T cell populations in autoimmune and inflammatory diseases.